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Abstract 

Overlapping fragments of genomic RNA spanning 6963 nucleotides from 5' end of spike (S) protein gene to 3' end of nucleocapsid (N) 
protein gene of turkey coronavirus (TCoV) were amplified by reverse-transcription-polymerase chain reaction (RT-PCR). The primers were 
derived from the corresponding sequences of infectious bronchitis virus (IBV). The PCR products were cloned and sequenced and their nucleic 
acid structure and similarity to published sequences of other coronaviruses were analyzed. Sequencing and subsequent analysis revealed 9 
open reading frames (ORFs) representing the entire S protein gene, tricistronic gene 3, membrane (M) protein gene, bicistronic gene 5, and N 
protein gene in the order of 5'-3'. The overall nucleic acid structures of these encoding regions of TCoV were very similar to the homologous 
regions of IBV. The consensus transcription-regulating sequence (TRS) of IBV, CT(T/G)AACAA, was highly conserved in TCoV genome 
at the levels of nucleotide sequence and location in regarding to the initiation codon of individual genes. Pair-wise comparison of gene 3, 
M gene, gene 5, or N gene sequences with their counterparts of IBV revealed high levels (82.1-92.0%) of similarity. Phylogenetic analysis 
based on the deduced amino acid sequences of S, M, or N protein demonstrated that TCoV was clustered within the same genomic lineage 
as the IBV strains while all the other mammalian coronaviruses were grouped into separate clusters corresponding to antigenic groups I or II. 
There were substantial differences of S protein sequence between TCoV and IBV with only 33.8-33.9% of similarity. 

© 2004 Elsevier B.V. All rights reserved. 
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1. Introduction 

Turkey coronavirus (TCoV) was identified in the early 
1970s as the major causative agent of the most costly disease 
of turkey encountered in Minnesota between 1951 and 1971 
(Nagaraja and Pomeroy, 1997). Outbreaks of turkey poult 
enteritis associated with TCoV have caused severe econom¬ 
ical losses in the turkey industry in Indiana, North Carolina, 
and other states for the last several years. Although the eco¬ 
nomical importance of this disease has been recognized for 
decades, the organization of genomic structure of TCoV is 
poorly understood and reports regarding the relationships of 
TCoV with other coronaviruses remained controversial (Van 
Regenmortel et ah, 2000; Gonzalez et ah, 2003). 
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Coronaviruses are pleomorphic, enveloped spherical 
particles surrounded by a fringe of 20 nm long club- 
shaped spikes. The diameter of coronaviral particles are 
around 140-150 nm. The coronavirus genome is a positive 
single-stranded capped RNA with a polyadenylated 3' 
end. Complete genomic RNA sequences of coronaviruses 
has been determined for infectious bronchitis virus (IBV; 
27,569 nucleotides; Boursnell et ah, 1987), murine hepatitis 
virus (MHV; 31,092 nucleotides; Lee et ah, 1991), human 
coronavirus (HCoV) strain 229E (27,277 nucleotides; 
Herold et ah, 1993), and transmissible gastroenteritis virus 
(TGEV; 28,579 nucleotides; Eleouet et ah, 1995; Penzes 
et ah, 2001). The 5' two-thirds of the coronavirus genome, 
approximately 20 kb, consists of two overlapping open 
reading frames (ORFs) that encode non-structural proteins 
including the viral RNA-dependent RNA polymerase and 
proteases. Another one-third nucleotide sequences from 
3' end contain ORFs encode the major structural proteins: 
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Fig. 1. (A) Schematic representation of turkey coronavirus genomic RNA, showing locations of coding regions for the spike (S) protein, gene 3. membrane (M) 
protein, gene 5, and nucleocapsid (N) protein. (B) Schematic representation of the location of primers used in the polymerase chain reaction (PCR), along with 
the intervening sequences (fragments I—III, or IV) amplified by the PCR. The sequences of primers: S-cor, tgaaaactgaacaaaagacagact; AS-cor, ccaaacataccaaggc- 
cactt; AS-corF, aagtggccttggtatgtttgg; MIBVR, gttcacacttagcaagccactg; MIBV, taagctttcagtggcttgctaagtgtgaacc; NIBV, tggatccaccgctaccttcaaacttgggcgc; Nup, 
tcttttgccatggcaag c; Ndown, tactcaaagttcattctc. 


spike (S), membrane (M), and nucleocapsid (N) proteins in 
the order of 5'-3' along the genome, respectively. 

Turkey coronavirus was initially determined to be anti- 
genically distinct from all other coronaviruses based on anti¬ 
genic differences revealed by immunoelectron microscopy 
(Ritchie et al., 1973) and hemagglutination-inhibition (Dea 
et al., 1986). This unique antigenicity was questioned when 
the close relationship between TCoV and bovine coronavirus 
(BCoV) was demonstrated in a series of antigenic studies 
(Dea et al., 1990) and by sequence analysis of TCoV M and 
N genes (Verbeek and Tijssen, 1991). In contrast, recent anti¬ 
genic (Guy et al., 1997; Loa et al., 2000) and genomic (Breslin 
et al., 1999a,b; Akin et al., 2001; Cavanagh et al., 2001,2002; 
Lin et al., 2002) analysis of TCoV, however, demonstrated 
that TCoV and IBV, two avian coronaviruses, are closely re¬ 
lated. The causes for these discrepant results regarding the 
relationships of TCoV with BCoV or IBV remained unclear. 
Further analysis of genomic structure of TCoV is important 
to clarify this enigma. Thus, the purpose of the present study 
was to determine the sequences of the 3' end coding region 
for structural protein genes of TCoV. 

2. Materials and methods 

2.1. Turkey coronavirus 

The TCoV isolate (isolate 540) used in the present study 
were recovered from fecal contents and intestines of turkey 
poults with acute coronaviral enteritis in Indiana, US in 1994. 
The viruses were passaged 5 times in 22-day-old embryonat- 
ing turkey eggs. The presence of TCoV in the intestines of 
embryos were confirmed by TCoV-specific immunofluores¬ 
cence antibody assays and electron microscopy at the In¬ 
diana State Animal Disease Diagnostic Laboratory in West 
Lafayette, Indiana, US. 

2.2. RNA isolation and reverse transcription 

Total RNA was extracted from the intestines and intestinal 
content of turkey embryo infected with TCoV by a modified 


method using guanidinium thiocyanate and acid-phenol 
(Chomczynski and Sacchi, 1987; Akin et al., 1999). Con¬ 
version of total RNA to cDNA was essentially performed 
according to a protocol supplied by the manufacturer 
of the reverse transcriptase (Superscript II system. Life 
Technologies, Gaithersburg, MD). 

2.3. PCR amplification 

Three microliters of cDNA were used in PCR ampli¬ 
fications with the primers designed from IBV genomic 
sequences. The locations and sequences of primers for the 
amplification of 4 fragments I-IV for 3' end coding region 
of TCoV structural protein genes are outlined in Fig. 1 . PCR 
was performed with a mixture (64:1, v:v) of Taq (Promega 
Corp., Madison, WI) and Pfu polymerases (Stratagene, 
La Jolla, CA) in a 96-well thermal cycler (GeneAmp, 
Perkin-Elmer Cetus Corp., Norwalk, CT) (Barnes, 1994; 
Akin et al., 1999). The cyclic parameters of the PCR was as 
follows: 94 °C for 1 min for denaturation, 37 °C for 2 min for 
annealing, and 72 °C for 5 min for extension for 40 cycles 
followed by 72 °C for 10 min for final extension. 

2.4. Molecular cloning and sequencing 

One microliter of the amplification product was used to 
ligate with pCR-II plasmid vector according to the man¬ 
ufacturer’s instructions (Invitrogen, San Diego, CA). De¬ 
termination of the nucleotide sequences of the selected 
clone with amplified sequences was performed by dideoxy- 
cycle sequencing method with the corresponding sequenc¬ 
ing primers for both strands (DAVIS Sequencing, Davis, 
CA). 

2.5. Sequence analysis 

The nucleotide and deduced amino acid sequences be¬ 
tween the TCoV and other coronaviruses were analyzed by 
DNAstar program (Lasergene Corp, Madison, WI), respec¬ 
tively. Percent similarities were calculated to find nucleic 
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S gene S 

tcov t g a a a a {5tga"a"ca"a|a agacagactt agttttt ggttt agtt AAGAGT GGCAAGTTATTAGTTAGAG ATG TTGGCGCCGTT 80 

Beau .C ... A A.... A.T....T.C.G..A .TAA.ACC 80 

TCoV GTGCTGCTTGACGAT A AT C T T G T C AC T T GC AT G T A AT G - T G AG T T C T AT T C C C AC A AT T T G T A AC A AT GC T A C T G T AG GT 159 

Beau TCTT. TAC. AGT . . CTC. T . . . . GT GC ACT.G. . C. . TT. . G. . . GA. AGT . G. . CT. . CGTT . A.AC C A A- 159 

TCoV AGTGGTGAGTGTGT ACCTTGC A- CAGACAT- - AC C AG A G T AT G C AC A T G G T A A A A AT G G G G G T G A AG C T C T A G A T T T T T A 236 

Beau .... CCTTCA. ACC. ... A. TGGTT. G. . . TT. . A. . G. GG. . . TT. . . - CGGT. G. T A AC A. TT- - . . A GC. . A. . . A. 236 


TCoV T AGTCCAGATGTT - - ATGAGACCGC- C A G AT G GT GC AT AT AT AC A AT C C G GT T AT T AT G A A C C AC T T T T C AC AG GT T GC T 313 

Beau . . a. G. . . GCTC. TC. . C. • GGT. TA. T. T. . . . AT- . . . TC. TGG. GGTCG. G. . G. T. . T - G. . . C. TCT. - - . A. . . 312 

TCoV TT AATCAAACT AATCAAACT GAT AGCACCTGT GCAAAT GGACTTT AT AGAGGGTCGCCAGGT AATTTT ACT AT ACAAGGT 393 

Beau A. G. CGGC. . CG- . . . T- . A. G. . TGG. T. . GT. T. GCA. T. AG. T. T. T AC T G. A. ACT.- TC. G. T. CTAC 389 


TCoV GATTTTCTT GAAAATT AT GAT GCT ATTGGCAT AATGTTT - T G G T G G G GT C T A G G T AC T A A A G T G G GT C AT C A C G AT C C C A 472 
Beau AG. G. . TG. T.-C. CAT. . T. . . AAA. . . GG. . GG. G. CC. TTAACTGGCA. G. . TC. ACA. A AT C. T. TA- - . GT G 463 


TCoV 

Beau 


T. . C. GC. A. G. A. A AT . 


. C T A.- . . . C. G. 


. . . G. AG. 


GTACCCT.C.T. 


552 

536 


TCoV ATTGCCAC- T G G A G A T AT T T T C GT A A A T G G C G T T T T A A T G G G T G T T T AT A A T C T A A AT T T T AC G C A A A C C T T A A C A A T T T 631 
Beau . . . T. AGTG. . T T A. . . A. . . AAC. TCC. TATA.AT. ... A. CT. GT. TAC. CC. C. . ATG. G. . . A. . GATG. . A 616 


TCoV GGTTAGCACAGTGTGTTGGCACAATGAAGGTTGTTATTTTACGTAATAGTAATGCTCTAGTTCAGTTTTCAGCTGGCAAC 711 
Beau CA. C- - TG. . . -.- - T. TTT. A. . . C. G. . GG AC C. . TA- . C. TA. . . A. T. A. GAGAG. AG. . AA. . . CC- TGG. 689 


TCoV GT AGTTG ATTTTGAACCCTGT ACAGGAGAC ACT ACT AT- T A AT A AGT T AC GT T GT G- C T T A T C A G C A A T T T A AT T T T T C T 789 

Beau T. . T. . TG. . AATGGTA. . . C. . • A. . TGTTA. TT. G. G. G. . GG A. C. .C.A.A.G. ..G.T.... TGCC. G. A. A AT A 769 

TCoV ACGGGATTTT ATGAC AT AGAT-AC T T T T G T AC C T G T G AC A- TCTAAT- - AT T AC AT AT T T AC C - - T T A C C C A G A T T T A 861 

Beau CT. . C. A. . . T. C. G. . G. C. TTT. TCC. . T. . . T A A. AGT. G. T. . G. TA. GC. G. AG. . . . TTGTC. . T. GT. . A A AT 849 

TCoV AAAGATAATACTGGTCAAGAGGTATATGATTTTTATGTAGCTCTTAAAGGAGATTCTGTTAGTTATAATCAAAGTTGTGT 941 
Beau . GT. T.AC. TGTAC. T. . C. CA. ...C-..T.--T..A...T. AGAC. GGC. CC. ACCC.CT. . . G. . . . 926 

TCoV . . AGA_CTCT AACTACCC ATTCTT-T A A AT T A A A G T G - - TA- - AT A AT A C T T AT C T A T G G G AT C AG G A T A AG T T G T 1009 

Beau TC. . . ATAT. . A. . . . TA. . . AA. A A A A AC. GC. C. G. . . . GT. . TT.TT. A. . T. T. CCTT. . T. AG. . - . . . T. 1005 

TCoV GTATATTAGGGTCTGAAATATATGTGCCGGGGTATAAGGTTTATACGCAACATTCATATGCTGGCACTATACCGAATTAC 1089 
Beau . . T. . . A. . . AGTCT. . T. T.AT- . . AT C T. . TCACCC. AGTTGT. A. . . T. G. C- TA. AA.T A AT . . . GG. 1083 

TCoV CCA- - GCTTGTTCTTCC AAAGGTCTT AGTT- T G G A A A AT AT T T AT A A G A AT AT AG GT T T C A AT AG AT T T T GT G T A AC T A A 1166 

Beau TTGTG. T. . AA. T C A. TTTCA. . T. C. A. . GCTT. CGG. CC. CT. C. AGG.-. . . . G. . . GCA. . C-. . . CTT. . . 1157 


TCoV ATTGTTGGAGACTCGT- GATATA- T C AT CT AT A AC C C A AT AC AC T T G T GT T T AT - GT C GTT AGT GT AG AAGGT T T T G AT A 1243 
Beau . GGT AG A. CA. . . T. . T. T. . . GC. T. • TC. . . TGGAGG. C. TT C GC . . . G. . AA. GT. . . TA. TC. . GT. AG. . A. . . C 1237 

TCoV GTTATAATTGGGGTATTATATACAGGTATGTTGGTGTAAACTTTGATTTTGGTACAGCTATGTATAGTGTGAAAAGTGCA 1323 
Beau A. A. . TT. G A A T . . GGAC . G. TAGTT.AC. A AG. G. GG. . GC . C. C. . . T. CAA. CAGCC. C. . AACCGCCA. TT 1317 


TCoV CCT ATGC AGTTCATCTAT A- C AG AGC A AT GT C A- - T A AT T AT A AT AT T T AC A A C AT A AG AG GT G-T T G GT C A A AT AT T 1397 

Beau ATA. CT. . AAA. . AT. . . . AT. ATATT. CT. T. AA. . C. . G. GT. GA. . . T. . T. . . TAT. . CAGAAC. . . C. . . GGT. . 1397 


TCoV 

Beau 


AA. GTGACCG. CT. AG. TG. . AGTT. T A A. TA.GACG. AG. . TTGGC. AT. 


GATA. A. C. G 


1477 

1474 


TCoV GT AAT AAT GGA AGT C TT T AT AGT G T AC A AC C G T GC C AG AC A G T T T C A AC T - - C A A GC T GT A AT T GT T A AC GG G A GT T T T G 1555 
Beau . . TCC. - . A. . CA.G. - -.G- . . . A- . T. TG. . C. T. . T. ATT. T A A G. . T. ACCC. TG. . A AG A. G. CA 1549 


TCoV T AGGCTT GTTTCTTCCT GCATCTT GT AGGTT AGCCGGACAATTTGGT CT AGGT AACCACACAGAAT ATGT AGAT GGTGGT 1635 
Beau ACCAGCA. . . . G. AGT. T. TGG. G. . . AA. . . . TA. . TATTC. . AC. TC. C. . . . TG. G. . T. GT. C. CAGCT. CT. . AG 1629 


TCoV T GT CTT ATT AGT AAT AAGACT AGT ACT AGAGAGCGT AGGT CT ACT AACGCT GTTT AT ACT GGT GAAT GC- ACAGGTT - TA 1713 

Beau AAC. AGT. . TAC. TC. . A. TC. C. . A. G. . ACA. . . C. T. T . . - - G. . . T. C. A. -.A- A. . . . TTG. . AA. . GCC 1705 

TCoV TTTGC-AGC T AT G G G T AC AT C T T G T AT T T AT AG T A AT G G C AC AGT T C T T A AT A G A AC AT T AC C AC A AC C A T C T G T T C A 1790 

Beau C. . AT GTT. . T.- . . AG. T.AA. ACC. G.T. . A-. . GCC. C..T.G.A. ..A.. TG. AA. . 1781 

TCoV AACTAGTGTGCAACCATTACTAGGTGTTGTGGCAAATGTGTCTATTCCAAAAGAATTGACACTTGCTGTAACCACAGAAT 1870 

Beau GTT.-... GC. . . T. . . T. T A A. . . . ACT. A.CTC. . A. . T. . CAGT. . C. ACT. AA. . . . T. . AGAT. . G. 1858 

TCoV ACTTACAAACTCGTTATCAGAAAGTTACTATTGATTGTGCTAAGTATGTCTGTGGTGATTCTATAAGATGCCGCACTCTG 1950 

Beau . . A.G. . . AT G G. T. . G. . CCAA. . . A. . . . CCTGC.T.CAG. . . . C. GG AT. . TA. A. AGT. . 1938 


Fig. 2. Nucleotide sequence of the amplified fragments containing entire spike (S) protein gene, gene 3, membrane (M) protein gene, gene 5, and nucleocapsid 
protein gene region of turkey coronavirus (TCoV) and their similarity to those of infectious bronchitis (IBV) strain Beaudette (GenBank accession number 
AJ311317). The positions where nucleotide bases are missing are indicated as (-) and identical nucleotides as (.). Heavy underlines below the sequence of 
TCoV indicate the putative start codons. Light lines above the sequence of TCoV indicate the stop codons. The conserved tanscription-regulating nucleotide 
sequence (A/C)T(T/G)AACAA, which is located upstream from the start codons of individual genes, is boxed. The start codon of IBV M protein gene is also 
underlined because it is at different position from that of TCoV M protein gene. 
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TCoV 

Beau 


CTGCAGCAGTATGGTTCTTTTTGCCAGAGTGTTAATG- CTATATTAAGTGGTGTTAATGAGAATGAAGATAATGGTTTAC 
T. T. . A. . A.GC. . G.G. C. ACA. -.T..T..G..G...A.A .GG. C. A. . A.TG. AAC. TT 


2029 

2017 


TCoV 

Beau 


T GCAAT AT GCGGAAGCT ATT AATACGGGTT ACACACTT AATTTTACT GGTTAT - AATGT ATCTAATTTGGGTGGGTTT GA 
. . A. T. TCTATTCTT. . .C...-..C..C.G -GT. . T. A. AC. . CA. . . C. T. G. AATGT. . GCACT. ... A. ... A. 


2108 

2093 


TCoV 

Beau 


T. CT. G. A- 


CT.C. T. . A. . . ACA. 


2188 

2164 


TCoV 

Beau 


AAGTTGTTACTCTAGGTGTTAGTGAAGTTGATCAGAATTATGATAAATGTATAGCATCTAGAGGCGGTTCTTTTACTAAT 
GC. . . . A AT. . G. T. . - AC. . CCA. CA- - A. . G. CGCA. . . A. A. . T. . C. CT. . .-. . - - ACCT. . AGG. . TT. 


2268 

2233 


TCoV 

Beau 


TTAGCAGATTTAACTTGTGCTCAGTTTTACAATGGCGTGATGGTGCTCCCTGGTGTTGTTGATCCTGGCCTAATGGCGGT 
. . . A- G. . CC. TG. G.GT GA A. . T.TT. .C.T...T.G... CC. A. CA. AACAG. A. AAA. GCAA. . TT. 


2348 

2312 


TCoV 

Beau 


GT ACACCGGTTCTTT AAT AGGTGGT ATGGCGCTTGGAGGCATT ACAGCAGCGGCATCCAT ACCATTTGCT ACTCAACTAC 
. . . T . . TA.C. . G. . . C. TC.TT. . . . T . . T.T.T. GTG. T.T.C. . A.G. 


2428 

2392 


TCoV 

Beau 


AAGCACGT GTCAATT ATTT AGCGTTGACACAATCGGTGCTTTT AGAT AACCAAAATTTGAT AGCAAATTCTTTT AAT AAA 
. G. . TA. AA. T. . . C. C. . G. G T A. T. . C. . G. . AC. TT. G. . GA. G. . T. . . G. A A A A. . T. . TGC. . . C.G 


2508 

2472 


TCoV 

Beau 


GTCCTTAAAGGTATACAGTCAGCACTAGACACTGTGTCCCAGGGATTTGTTGAAGTAGCTAGAGGTTTTGAGAGTGTAAC 
. C. A. .-G. . . .-. A. .- .... A.--AGA. . . AC. T. 


2588 

2510 


TCoV 

Beau 


T GT AGCACTT AAT AAAGTCCAGGATGT AGTT AAT ACACATTCTGACAT ATT AAAT AAACTT ATGGCTCAGCTT AGT GTT A 
. C.T. AC. AC. . A. T. . A.T. . . . G. . A. . . GAG. . CT. . TC. T. C. G. GAC.ATCA. ... A. AAA. 


2668 

2590 


TCoV 

Beau 


ATTTT GGT GCTGTTTCATCTTCTCTT AAT GAGAT AT ACCT AAAATT AGGCCAGATT AAT GCAGATGCTCAGGTT GAT AGG 
.A. . . . T. . . GTGA. . C. A. . A. . C. . . . A G C. . . . T. A. GCC. . AC. A. . . A.A. . G. . . C. T 


2748 

2670 


TCoV 

Beau 


TTAATTACTGGTAGGTTGACTGCATTATCAACTTACGTAGCATCCCTTCAATTAGCAGCTTATA- - AGGCTGATCAGTCA 
C. T. . A.A. . . T. AT. . C. T. . TGT. ...TGC. A. GCAG. . G. AG. . . . TT. . AG. . TCACAA. . 


2826 

2747 


TCoV 

Beau 


AGACGC- TTGGCTCT ACAGAAAGT AGAGGAGT GTGTT AAATCGCAAT CT ATGCGAT ATGGTTTCTGTGGCAATGGT AGCC 
GCGT. AG. . A. . . ACT.A. TA. T.G. . A. . G.TA. G. . CTCC. . T.T.AC. A. 


2905 

2827 


TCoV 

Beau 


ATGTTCTAACAATACCACAGAGTGCTCCTAATGGAATATTCTTCATCCACTACACGTACCAACCCACTGCTTATGTTACT 
.C.G. . A. A. . . A. T...G.G..T..A.... TTT. T. . TACT. . AGA. AG. . T.A. 


2985 

2907 


TCoV 

Beau 


ATTGAAGCAGTGCCTGGTCT ATGTGT AACAACACCTT-CTGGAAAGTATGGTGTAATGCCCAAGACTGGAAGTGGCAT 

G. . ACT. . . A. AGT G. . . T. T.AGC. . G. . A AT G. . A. TC.CAA. . G.GC T . A. ..T..G..T.. 


3062 

2987 


TCoV 

Beau 


CAT ATTT AGACAGAAT GGT ACTTTCTTTGTCACTTCT ACACAGTT AT ATGAGCCT A- AGCTTTT AT CAT ATTCAGAAGT G 
TT. TA. ACA. GTT.G. . A. . ACA.G. AC G. G. TA. G. . . AT. ..A.G....A... CTGC. GG- . . . T. . A 


3141 

3066 


TCoV 

Beau 


GTTAATTTAACATCTTGTGAAGCTAATTATTACAATGTTAGTGAAAGTGAAACACCCTTTCAACCAGCATTGCCTAATTT 
.... CGC. T. . T.C. . . . A... . A. . . . T. . A. A. G. . CGT. ATG. . GA- - 


3221 

3134 


TCoV 

Beau 


CGATGATGAGTTTAATGACATTTTTACTGATTTAAATATCTCAAAAGATGCCATTAAAAATATCAGTTCCAATTTTAACT 
.A.T. . . G. .-. . . . A. . . C- - G. . . - . G.- - . . GT GG- . . TG. . . CT.-. GCA. G. G. . 


3301 

3197 


TCoV 

Beau 


ACACAATTCCCATCCT AGATCTTCAGGGTGAGATTTGT AGATTGAACCATTCCATT AACACACT AT AT AATT AT AGTGAT 
. . CAG. C. TTG. C-. AA-. . AC. C. G. - . C. T. .A. - -. . G. . . T - - . G-. . . G. G.A. T. . . . 


3381 

3258 


TCoV 

Beau 


GTTATTGACAAAATTAACCAGGGTCTTAATGACACCTTTATTAATCTTGAACAACTTAATAAAATTACACGTTACATTAA 
CG. . . . C. AGGCG. . . TA.T. TC. A. . AG. C.A.T C A. T. C. C. A. AC. . . T. 


3461 

3338 


TCoV 

Beau 


GTGGCCTT GGT AT GTTT GGTT AGCAATT GGTT CT GCTT GTTT AAT ATT CATCTT AAT ATT AGGATGGGTTTTTTT CAT GA 
.G.C. . A. C. . T. . . CAC. A. T. . C.C.C. 


3541 

3418 


TCoV 

Beau 


TCoV 

Beau 


CTGGTTGTTGTGGTTGTTGTTGTGGATGCTTTGGCATTATTCCGCTAATGAGTAAGTGTGGTAAGAAATCCTCTTACTAC 

. G..T.T.T... 

Gene 3 _3a 

ACGACCTTTGATAATGATGTGGTGT|ATGAACA A| A ACAGACCTAAAAAGTCTGTTTA ATGA TTCAAACTCCAACATCTTTT 
.T.C.A AC.T.G.C. . . . G. . . C. . G. . C. . C 


3621 

3498 


3701 

3578 


TCoV 

Beau 


TCoV 

Beau 


_ 3b 

ACAAGTTTT ACTCCAAATT ATT AAT AGT AACTT ACAGTCT AGACTTCT GCTTTGGCACAGT CT AGACT AATGTT AGATTT 
.C.GACC.. . . . . . A. C. . 


3781 

3658 


3861 

3738 


TCoV 

Beau 


TGAAGCAATTATTGAAACTGGTCAGCAAATAATTCAACAAATCAGTTTCGATTTGCAACACATTTCAAGTGTGCTAAGCA 
A. . . . T.G.G. G.A.A. . . . A. . G. . T.AT. . . A. . 


3941 

3818 


Fig. 2. ( Continued ) 
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TCoV 
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TCoV 
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TCoV 
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TCoV 
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TCoV 
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TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 

TCoV 

Beau 


CT GAACT ATTTGACCCCTTTGAAGTTTGTGTTT ACAGAGGAGGT AATT ATTGGGAGTT AGAGT CAGCT GACGAGTTTT CA 4021 
.A...G.T.CTA....TA.T.AA.A..T.G.... 3898 

3c _ 

GGT G AT G ACGAAT AT ATTGAGT AAATCGCT AGAGGAGAACGGAAGTTTCCT AACAGCAGTTT ACAT ATTT ATTGGATTTT 4101 
. T....T.A...G.T.T.GC.A..G.A . 3978 

TAGCACTTTACCTATTAGGTAGAGCACTTCAAACATTTGTACAGGCTGCTGATGCTTGTTGTTTGTTTTGGTATACATGG 4181 

. T..TC .G.A. 4058 

M gene 

GTAGTAGTTCCTGGAGCTAAGGGTGCAGCCTTTGTATACAATTATACATATGGTAAAAAA £ T T A AC A a|a CCGGAGTTAGA 4261 
.A. ...A .A.G.G.T.A. 4138 

M 

AGCGGTT ATT GTT AACGAGTT CCCT AAGAACGGTT GGAAT AGT-C C AG C A A AT T T C C G A C G T G- AT G G AA- - 4329 

. . . A.T.A. A A A A AT.T. A. G A. . CCCA. C. AG. CA 4218 

AATT GCACACT TA"AC TCAGAGCAGGCAGTTCT GCTTTTT AAAGAAT AT AATTT ATTT AT AACCGCATTCCT ATT GTTTTT 4409 
. T..T...G...TT..A...T .A.G.T. T.G . C.. 4298 

AACCAT ACT ACTTCAGT AT GGGT ACGCAACT AGGAGTCGGGTT ATTT ACAT ACTGAAAATGAT AGT GTT AT GGT GCTTTT 4489 
.A. C..T.A .. A... A A. T.C . 4378 

GGCCCATTAACATTGCAGTAGGTGTAATTTCATGTGTATATCCACCAAATACAGGAGGTCTTGTCGCAGCGATAATACTT 4569 
.C. AC...C .C. 4458 

ACTGTGTTTGCTTGTCTTTCTTTTATAGGTTATTGGATCCAGAGTATCAGACTCTTTAAAAGGTGTAAGTCTTGGTGGTC 4649 
..A.G.G.G.T.GC. G...A . 4538 

ATTTAACCCCGAATCTAATTCCGTAGGTTCAATACTCCTAACTAATGGTCAACAATGCAATTTTGCTATAGAGAGTGTGC 4729 
. T..A .G.T. 4618 

CTATGGTGCTTTCCCCTATTATAAAGAATGGTGTCCTTTATTGTGAGGGTCAGTGGCTTGCTAAGTGTGAACCAGACCAC 4809 
.A. T..A .T. 4698 

TT GCCT AAAGAT AT ATTTGTGT GCACACCGGAT AGACGT AAT ATCT ATCGT AT GGT GCAGAAAT ACACTGGT GATCAAAG 4889 
. T..T .C.T.C. 4778 

CGGAAATAAGAAAAGGTTTGCTACATTTGTCTATGCAAAGCAGTCAGTAGACACCGGCGAGCTAGAAAGTGTAGCAACGT 4969 
.G. T..T .AG 4858 

CTGGAAGT AGCCTTT ACACATAAATGTGTGTGTGT AGAGAGT ATTT AAAATT ATTCTTCAAT AGTGCCTCT ATTTT AAGA 5049 
GA.T.T.C.G. 4938 

GCGCGGAAGAGT ATTTGTTTTGAGGATATT AAT AT AAATCCTTTTTGTTTTGCACTCTCTTTGCAAGAGTT ATT ATTT AA 5129 
. ... AT.A.C.C.AT.C. . T.C. . . .- 5012 

GCAACAGTTTTTCCTTTCTTTTGTTTGGAAGAAAGTTGTTGTTAATGGTGTAGAATTCCAAGTAGAAAATGGAAAAGTCC 5209 
--...G..G..A ---T.T 5049 

ACT ACGAAGGAAATCCAATCTTCCAAAAAGGTTGTTGT AGTTTGTGGTCCGATT AT AAGAAAGATT AGAAT AATT AAGCC 5289 
.CA. . . G. T.GA.A...G. A. . 5124 

Gene 5 

ACC AACTACACTT ATTTTT AAAAAGGGCGTTTT AGCTT AT AAATC|CTT AACA4AT ACGGACG ATG AAATGGCTGACT AGT 5369 
...T.T..GA..T.CT.A.C...AA . 5204 

TTTGGAAGAGCGGTTGTTTCATGTTACAAAGCACTACTATTAACTCAACTTAGAGTGTTAGATAGGTTAATTTTAGATCA 5449 
. A... A.... T.T...T.C . 5284 

CGGACCAAAACGCGTTTTAACGTGTAGTAGGCGAGTGCTTTTGTTTCAGTTAGATTTAGTTTATAGGTTGGCGTTTACGC 5529 
. T.CT.C.AG... .A .A. 5364 

5b_ 

CCACCCAACCGCTGGT ATG A AT A A-AGATAATCCTTTTCGCAGAGCAATAGCAAGAAAAGCGCGAATTTATCTGA 5603 

.T.C.T AGT A A.G.T. 5444 

N gene _ 

GAGAAGGATT AGATTGT GTTT ACTTT1C TT AACA4AGCAGGACAAGCAGATCCTTGCCCCGAGTGTACCTCTCTGGT ATTC 5683 
. . .G. T....C .A. 5524 

N 

CGAGGGAAAATTTGTGAGGAACACATAAATAATAATAATCTTTTGTC ATG GCAAGCGGTAAGGCTACTGGAAAAACAGAC 5763 
.A.C.C.A. . AG. 5604 

GCTCCGGCGCC AGTC AT C A A AC TTg G AG G AC C A A AG C C AC C T A A AGT T G GT T C T T CTGGAAGT G T G T C T T GG T T T C A AGC 5843 
. . C . . A.T.A.C. A..CA . 5684 

AATAAAAGCCAAGAAGCTAAATTCACCTCAGCCTAAGTTTGAAGGTAGCGGTGTTCCTGATAATGAAAATCTTAAAACAA 5923 
.T.A. C...C.C.CA....GC... 5764 


Fig. 2. ( Continued) 
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TCoV GCCAGCAACATGGGTATTGGAGACGCCAAGCTAGGTTTAAGTCAAGTAGAGGAGGAAGAAAACCAGTCCCAGATGCCTGG 6003 

Beau . A..C.C.C..G.C.A...T.T... 5844 

TCoV T ACTTCT ATT AT ACTGGAACAGGACCAGCCGCT GACCTGCAATGGGGTGAT AGCCAAGCGGGT AT AGT GT GGGTTGCTGC 6083 
Beau . T..C.T.A.C.CT....AT. 5924 

TCoV AAAGGGTGCTGATGTTAAATCTAAATCTAACCAGGGTACAAGGGACCCTGACAAGTTTGACCAATATCCACTACGTTTCT 6163 

Beau T. AC .G...C..T.A..T.T.C.A.... 6004 

TCoV CTGATGGTGGACCTGATGGTAATTTCCGTTGGGATTTCATTCCTTTGAATCGTGGTAGGAGTGGAAGATCAACAGCAGTT 6243 

Beau .G.C.C.CC....C.C. 6084 

TCoV TCATCAGCAGCATCTATTAGAGCACCATCACGTGAAGGTTCGCGTGGTCGTAGAAGTGGTGCTGAAGATGATCTTATTGC 6323 

Beau . G...G.A.T...G.C. 6164 

TCoV TCGCGCAGCAAAGATAATTCAGGATCAGCAGAAGAGGGGTTCTCGCATTACCAAGGCTAAGGCTGAAGAAATGGCCCATC 6403 

Beau ...T.C.A.A...C.A.A..T.T.... 6244 

TCoV GCCGGT ACTGC AAGCGC ACTGTCCC ACCTGGTT AT AAGGTTGAGC AAGTTTTTGGTCCCCGT ACT AAAGGTAAGGAGGGA 6483 

Beau .T.A.AA.G.T.G.G 6324 

TCoV AATTTTGGTGAT GACAAGAT GAAT GAGGAAGGT ATT AAGGATGGGCGTGTT ACAGCAAT GCTCAACCT AGTCCCT AGCAG 6563 

Beau . 6404 

TCoV CCATGCTTGTCTTTTTGCAAGTAAATTGACGCCCAAACTTCAACCAGATGGGCTGCACTTGAAATTTGAATTTACTACTG 6643 

Beau .G. G.G....A .T.T.G. 6484 

TCoV TGGTCCCACGTGATGATCCGCAGTTTGATAATTATGTTAGTATTTGTGATCAGTGTGTCGATGGTGTAGGAACACGTCCA 6723 

Beau .T.C.G. AA.G. 6564 

TCoV AAAGATGACGAACCGAGACCAAAGTCACGCGCAAGTTCAAGACCTGCTACAAGAGGAAATTCTCCAGCTCCAAGACAACA 6803 

Beau .A. A.T.G. 6644 

TCoV GCGTCT AAAGAAGGAGAAAAAGCCAAAGAAGCAGGAT GATGAAGT AGAT AAAGCATT GACCTCAGAT GAGGAGAGGAACA 6883 

Beau ...C.C .T.C. 6724 

TCoV ATGCACAGCTGGAATTTGATGATGAACCCAAGGTGATTAACTGGGGGGATTCAGCTCTAGGGGAGAATGAACTTTGAGTA 6963 

Beau .T.G.A.G. 6804 

Fig. 2. ( Continued ). 


acid and amino acid pair distances. Based on the obtained 
sequences of TCoV and previously published sequences 
of different coronaviruses, phylogenetic trees were con¬ 
structed according to the coding sequences for S, M, and N 
genes. 

3. Results 

3.1. Complete nucleotide sequences of 3' end coding 
region for structural protein genes of turkey coronavirus 

Cloning and sequencing of the 4 overlapping fragments re¬ 
vealed a total of 6963 nucleotides in a region containing the 
entire S protein gene, tricistronic gene 3, M protein gene, bi- 
cistronic gene 5, and N protein gene of TCoV in the present 
study. The primary structures of the coding sequences for 
these genes of TCoV in the present study were very sim¬ 
ilar to those found in the corresponding genomic regions 
of IBV strain Beaudette as shown in Fig. 2 and Table 1. 
The canonical consensus transcription-regulating sequence 
(TRS) of IBV, CT(T/G)AACAA, was also found in TCoV 
in the present study. Both the nucleotide sequence of the 
TRS and the distance between the 3' end of the TRS and 
the initiation codon of the downstream adjacent ORF were 
highly conserved between TCoV in the present study and 
IBV (Table 1). 


Table 1 

Comparison of the 3' end encoding regions between turkey coronavirus 
(TCoV) and infectious bronchitis virus (IBV) strain Beaudette (GenBank 
accession number AJ311317) 


Virus 

Gene 

ORF a size 
(nucleotides) 

TRS b 

sequence 

RS distance 0 
(nucleotides) 

TCoV 

Spike 

3612 

ctgaacaa 

52 


Gene 3 


atgaacaa 

23 


3a 

174 




3b 

195 




3c 

318 




Membrane 

669 

cttaacaa 

74 


Gene 5 


cttaacaa 

9 


5a 

198 




5b 

243 




Nucleocapsid 

1230 

cttaacaa 

93 

IBV 

Spike 

3489 

ctgaacaa 

52 


Gene 3 


ctgaacaa 

23 


3a 

174 




3b 

195 




3c 

330 




Membrane 

678 

cttaacaa 

77 


Gene 5 


cttaacaa 

9 


5a 

198 




5b 

249 




Nucleocapsid 

1230 

cttaacaa 

93 


a ORF: open reading frame. 
b TRS: transcription-regulating sequence. 

c The distance is calculated as nucleotides between 3’ end of TRS and 


the ATG start codon of the corresponding first downstream ORF. 























Table 2 

Sequence pair distances for nucleic acid and deduced amino acid sequence of the entire spike protein gene region of turkey coronavirus (TCoV) with other coronaviruses 
Nucleotide identity (%) 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

1 TCoV a 

100 

95.1 

95.4 

51.7 

52.5 

52.1 

42.9 

41.8 

41.8 

38.3 

42.6 

42.8 

41.1 

38.8 

41.3 

39.3 

2 TCoV-Gh b 

90.2 

100 

96.8 

50.6 

51.3 

51.0 

42.2 

41.4 

41.3 

38.1 

41.9 

42.2 

40.2 

38.1 

41.0 

38.4 

3 TCoV-GI c 

91.1 

95.1 

100 

50.3 

51.0 

50.7 

42.0 

41.3 

41.3 

37.9 

41.8 

42.1 

40.1 

38.3 

41.0 

38.2 

4 IBV-CU d 

33.9 

32.8 

32.7 

100 

85.3 

85.6 

41.7 

39.9 

39.7 

38.6 

41.5 

41.8 

40.6 

38.6 

40.3 

39.5 

5 IBV-KB e 

33.8 

33.1 

32.7 

83.2 

100 

94.4 

42.6 

40.7 

40.6 

38.1 

42.2 

42.7 

41.1 

39.3 

41.2 

40.6 

6 IBV-Beau f 

33.9 

33.1 

32.7 

84.6 

94.0 

100 

42.8 

40.7 

40.7 

38.1 

42.3 

42.9 

41.6 

39.3 

41.1 

40.4 

7 BCoV® 

22.3 

21.8 

21.8 

20.4 

21.3 

20.7 

100 

38.2 

38.6 

33.2 

93.9 

98.7 

67.1 

36.1 

38.7 

38.6 

8 CCoV 1 ’ 

25.4 

24.0 

24.5 

23.8 

23.2 

23.3 

21.3 

100 

90.4 

44.5 

35.5 

35.9 

35.6 

50.3 

82.7 

33.7 

9 FECoV 1 

25.2 

23.9 

24.4 

24.0 

23.3 

23.5 

21.3 

93.1 

100 

44.1 

36.3 

36.2 

36.1 

50.9 

84.3 

33.7 

10 HCoV229Ei 

25.9 

25.2 

25.6 

24.7 

24.8 

24.5 

18.4 

38.8 

39.1 

100 

38.0 

38.5 

37.8 

57.5 

55.3 

36.5 

11 HCoVOC43 k 

22.3 

21.8 

21.7 

20.3 

21.2 

20.7 

91.9 

20.2 

20.1 

21.0 

100 

94.1 

67.0 

36.2 

38.6 

38.2 

12 HECoV 1 

22.3 

21.6 

21.7 

20.3 

21.2 

20.6 

97.9 

20.0 

19.8 

21.4 

92.2 

100 

67.0 

36.2 

38.6 

38.5 

13 MHV m 

22.3 

22.1 

21.7 

21.1 

21.8 

21.2 

65.7 

19.5 

19.5 

22.1 

65.4 

65.9 

100 

35.8 

38.4 

37.2 

14 PEDV" 

26.4 

25.4 

25.9 

25.1 

25.4 

25.5 

19.8 

42.6 

43.3 

49.7 

20.2 

20.2 

20.9 

100 

52.6 

32.8 

15 TGEV° 

25.0 

23.6 

24.2 

24.1 

23.5 

23.6 

21.3 

80.5 

81.5 

48.0 

21.6 

21.4 

21.2 

44.8 

100 

33.8 

16 SARSP 

19.5 

18.6 

18.8 

21.6 

21.1 

21.4 

22.6 

18.2 

18.3 

22.1 

22.4 

22.7 

22.3 

19.7 

17.9 

100 


Amino acid identity (%) 
a TCoV: a US, Indiana, isolate of TCoV. 

b TCoV-Gh: an isolate of TCoV. GenBank accession number AY342356. 
c TCoV-GI: an isolate of TCoV. GenBank accession number AY342357. 

d IBV-CU: a German strain, CU-T2, of infectious bronchitis virus (IBV). GenBank accession number U49858. 
e IBV-KB: a Japanese strain, KB8523, of IBV. GenBank accession number M21515. 
f IBV-Beau: a US strain, Beaudette, of IBV. GenBank accession number AJ311317. 
g BCoV: bovine coronavirus. GenBank accession number M64668. 
h CCoV: canine coronavirus. GenBank accession number X77047. 

1 FECoV: feline enteric coronavirus. GenBank accession number X80799. 

J HCoV229E: human coronavirus strain 229E. GenBank accession number AF344186. 
k HCoVOC43: human coronavirus strain OC43. GenBank accession number LI4643. 

1 HECoV: human enteric coronavirus. GenBank accession number L07748. 
m MHV: murine hepatitis coronavirus. GenBank accession number U72635. 
n PEDV: porcine epidemic diarrhea coronavirus. GenBank accession number Z25483. 

° TGEV: porcine transmissible gastroenteritis coronavirus. GenBank accession number AJ271965. 
p SARS: severe and acute respiratory syndrome coronavirus. GenBank accession number NC_004718. 
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3.2. Sequence comparison and phylogenetic analysis 

Pair-wise comparison of nucleotide and deduced amino 
acid sequence distance between TCoV S protein gene in the 
present study and the homologous gene sequences of other 
known coronaviruses is summarized in Table 2. The simi¬ 
larity score between TCoV in the present study and other 
non-TCoV coronaviruses within the S protein gene region 
ranged from 38.3% to 52.5% at the nucleitide level or from 
19.5% to 33.9% at the amino acid level. 

The similarity scores between TCoV in the present study 
and IBV strains within the M or N protein gene region were 
high (>80%). In contrast, the similarity score between TCoV 
in the present study and other mammalian coronaviruses 
within the M or N protein gene region ranged from 24.8% to 
30.8% for nucleotide sequence and from 16.9% to 29.1% for 
deduced amino acid sequence. The tricistronic gene 3 with 3 
overlapping ORFs, 3a-c, in between S and M genes as well 
as the dicistronic gene 5 with 2 overlapping ORFs, 5a and 
5b, in between M and N genes of TCoV in the present study 
all shared high similarity with the corresponding genomic 
sequences of IBV strains. 

Phylogenetic analysis according to the deduced amino 
acid sequence of S, M, or N proteins indicated that TCoV in 
the present study was clustered within the same genomic lin¬ 
eage as the IBV strains while all the other mammalian coro¬ 
naviruses were grouped into separate clusters corresponding 
to antigenic groups I and II (data not shown). 

4. Discussion 

Total 6963 nucleotides of TCoV genome were cloned and 
sequenced in the present study. This region is likely to include 
all of the viral genes excluding the polymerase gene and, thus, 
provides substantial genetic information of the virus for com¬ 
parison with other coronaviruses. The genomic structures of 
ORFs for S protein, 3a-c, M protein, 5a-b, and N protein were 
very similar to those of IBV. The phylogenetic analysis based 
on the deduced amino acid sequences of S, M, or N protein 
all showed that the TCoV in the present study was classified 
within the same genomic lineage with IBV strains while all 
the other mammalian coronaviruses including BCoV were 
grouped to separate clusters. The nucleotide sequences of 
ORFs for 3a-c, M protein, 5a-b, and N protein of TCoV 
shared high similarity (82.1-92.0%) with the correspond¬ 
ing sequences of IBV. These results clearly demonstrated the 
close relationship of TCoV in the present study to avian IBV. 

The presence of tricistronic gene 3 in between 3' end of 
S gene and 5' end of M gene as well as the presence of di¬ 
cistronic gene 5 in between 3' end of M gene and 5' end of N 
gene are particular features of avian coronaviruses, TCoV and 
IBV. These particular genomic structures are not found in any 
other mammalian coronaviruses as determined to date. These 
distinct features of genome structure implied that TCoV 
shares a relatively close evolutionary relationship with IBV. 


The predicted proteins of ORF 3a-c, 5a and 5b were 
small (about or less than 10 kd). The functions of these 
gene products are not known. Several ORFs encoding non- 
structural proteins have been recognized in coronavirus 
genomes (Boursnell et al., 1987; Lee et al., 1991; Herold et al., 
1993; Eleouetetal., 1995). The number, nucleotide sequence, 
and gene order of these ORFs varied remarkably among dif¬ 
ferent coronaviruses. It is speculated that these genes were 
inserted into different sites in the coronavirus genomes due 
to the RNA recombination-prone discontinuous transcrip¬ 
tion mechanism and were not essential for virus replication 
and pathogenesis. However, sequence analysis in the present 
study indicated that both nucleotide sequences and locations 
of these ORFs and their consensus TRS of TCoV are highly 
conserved with those of IBV. Given such a highly conserved 
sequences and structures within avian coronaviruses, genes 3 
and 5 may play important roles in coronavirus pathogenesis 
to avian species. 

One of the characteristic features for coronavirus repli¬ 
cation is the synthesis of a 3' coterminal nested set of 
polycistronic subgenomic mRNAs by a discontinuous tran¬ 
scription mechanism. Several conserved TRS have been 
identified for different coronavirus proximal to the initi¬ 
ation codon of the first ORF for each particular subge¬ 
nomic mRNA. The consensus sequences of the TRS sites 
are CT(T/G)AACAA for IBV, ATC(T/C)AAAC for BCoV, 
AACTAAAC for TGEV, AATC(T/C)A(A/T)AC for MHV, 
and AACTAAAC for FIPV (Spaan et al., 1988; Stirrups 
et al., 2000). The distance between the TRS and the first ORF 
is different for each subgenomic mRNA of different coron¬ 
aviruses. Both the nucleotide sequence of TRS and the dis¬ 
tance between the TRS 3' end and the initiation codon of first 
ORF are suggested to play important role in the transcription 
of mRNAs. As shown in the present study, the TRS sequences 
of TCoV were found highly conserved with those of the corre¬ 
sponding genes of IBV except one nucleotide substitution in 
that of gene 3. The TRS of gene 3 is ATGAACAA for TCoV 
and CTGAACAA for IBV. The distances between TRS and 
initiation codon of S gene, gene 3, gene 5, and N gene of 
TCoV were all the same as those of IBV while the distances 
for TCoV or IBV M gene are 74 or 77, respectively. The 
highly conserved sequences and structures of TRS between 
TCoV and IBV provide further evidence that these two avian 
coronaviruses share close evolutionary relationship. These 
highly conserved TRS sequences of IBV has been shown to 
be recombination “hot spot” and may serve as the template 
switching sites for the viral encoded RNA dependent RNA 
polymerase (Lee and Jackwood, 2000). These recombina¬ 
tion events play important role to the emergence of new IBV 
variants responsible for continuous outbreaks in the chicken 
flocks vaccinated with live attenuated viruses due to failure of 
cross protection. It is possible that the similar recombination 
events of IBV in chicken may contribute to the origin and 
evolution of TCoV in turkey and merit further investigation. 

Even though the close genetic relationship between TCoV 
and IBV was clearly demonstrated as discussed above, 
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these two avian coronaviruses are dramatically different at 
the S protein gene level. The similarity of S protein se¬ 
quences between TCoV in the present study and IBV strains 
(33.8-33.9%) is much lower than that among IBV strains 
(83.2-94.0%). The difference of nucleotide between TCoV 
in the present study and IBV seems to be randomly distributed 
throughout the entire S gene except a stretch of 225 nu¬ 
cleotides from the 3' end that shared high similarity (88.9%) 
with the corresponding sequences of IBV. These observations 
suggested that cross-over homologous recombination, very 
likely by a template switching mechanism, occurred around 
the consensus TRS site of S gene and within the 3' end 225 
nucleotides region (involving the TRS site of gene 3) and 
resulted in a whole new codon reading frame for S protein 
of TCoV with conserved TRS and other genomic structure 
features of IBV. Spike protein of coronaviruses has been well 
known as the major structural protein responsible for attach¬ 
ment, fusion, and penetration of virions to the target cells. 
The substantial difference of S protein gene between TCoV 
and IBV well explains the different host tropism and different 
tissue pathogenicity of these two avian coronaviruses. Turkey 
coronavirus is associated with enteric disease of turkey while 
IBV is usually associated with respiratory disease in chicken. 

Two group-specific monoclonal antibodies, which reacted 
with a broad spectrum of homologous and heterologous IBV 
serotypes, were tested for reactivity with TCoV in a previous 
study (Loa et al., 2000). The antibody specific to M protein 
(Mab 919) of IBV had strong cross reactivity with TCoV but 
the antibody specific to S protein (Mab 94) of IBV did not 
react with TCoV. In line with these previous observations of 
antigenicity, the sequence analysis in the present study re¬ 
vealed a high homology of M protein gene between TCoV 
in the present study and IBV. On the other hand, the differ¬ 
ence of S protein gene between TCoV in the present study 
and IBV is substantial. Therefore, molecular diagnostic as¬ 
say or antigenic analysis using antibody specific to S pro¬ 
tein or gene will be useful tools to differentiate TCoV from 
IBV. 

The results of sequence analysis in the present study stress 
the close relationship of TCoV to IBV. Coronavirus genomes 
are dynamic with high frequency of recombination, insertion, 
and deletion, subsequently, may result in significant genetic 
differences. Further cloning and sequencing analysis of full- 
length genomic sequences of more TCoV isolates are under 
way for revealing a faithful picture of the TCoV genome. 
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